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Abstract 

We prove a one-shot “minimax” converse bound for quantum channel coding assisted by positive 
partial transpose channels between sender and receiver. The bound is similar in spirit to the converse by 
Polyanskiy, Poor, and Verdu [IEEE Trans. Info. Theory 56, 2307-2359 (2010)] for classical channel cod¬ 
ing, and also enjoys the saddle point property enabling the order of optimizations to be interchanged. 
Equivalently, the bound can be formulated as a semidefinite program satisfying strong duality. The 
convex nature of the bound implies channel symmetries can substantially simplify the optimization, 
enabling us to explicitly compute the finite blocklength behavior for several simple qubit channels. In 
particular, we find that finite blocklength converse statements for the classical erasure channel apply to 
the assisted quantum erasure channel, while bounds for the classical binary symmetric channel apply 
to both the assisted dephasing and depolarizing channels. This implies that these qubit channels inherit 
statements regarding the asymptotic limit of large blocklength, such as the strong converse or second- 
order converse rates, from their classical counterparts. Moreover, for the dephasing channel, the finite 
blocklength bounds are as tight as those for the classical binary symmetric channel, since coding for 
classical phase errors yields equivalently-performing unassisted quantum codes. 

This paper has been merged with [1]; see [2], 


1 Introduction 

The capacity of a noisy channel is the ultimate, in-principle limit on its capability for reliable communi¬ 
cation, and therefore studying channel capacity is an important goal in information theory. By its nature, 
the capacity is not of immediate practical concern, as it ignores the resource requirements that would 
be needed to achieve the limit. Approaching capacity might, in principle, require coding operations and 
blocklengths too cumbersome or large to be implementable. Nevertheless, several classical coding tech¬ 
niques developed in recent years have narrowed the gap between in-principle and in-practice for classical 
communication over classical channel, in particular polar codes [3] and spatially-coupled low-density 
parity-check codes [4]. Coding and decoding operations can be performed efficiently (quasilinearly) in 
the blocklengths of these codes, though the blocklengths themselves must still be rather large to approach 
capacity. The situation is dramatically different for quantum coding, where accurate control of quantum 
systems is a major experimental challenge, while manipulation and storage of classical bits is obscenely 
easy by comparison. 

Thus, it is of interest to better understand the possible performance of codes operating with fixed 
resources, in particular at finite blocklength. A bound which limits the performance of a coding scheme 
given fixed resources is known as a converse bound. For classical channels, the first truly systematic results 
on converse bounds limiting the size (blocklength) of codes with a given error probability were given by 
Polyanskiy, Poor, and Verdu [5]. They formulated the converse bound in terms of a minimax optimization, 
and showed by numerical examples that the bound is quite tight for several channels of interest even at 
small blocklengths, by comparing to existing and novel achievability bounds. Subsequently, Matthews 
[6] and Polyanskiy [7] demonstrated concavity and convexity properties of the bound which enable it 
to be formulated as a linear program (Matthews) or equivalently that the order of minimization and 
maximzation can be interchanged (Polyanskiy). Their results imply that channel symmetries can be used 
to simplify the optimization. 

For quantum channels, Matthews and Wehner extended the minimax approach to the task of trans¬ 
mitting classical information over quantum channels and formulated a bound in terms of a semidefinite 
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program [8]. Recently, Leung and Matthews gave a semidefinite program for the optimal achievable fi¬ 
delity for transmitting quantum information by codes of a fixed size [9]. In both of these cases the codes 
under consideration include the possibility of assistance by forward or reverse communication between 
sender and receiver, as this makes it possible to formulate the bounds as convex optimizations. 

In this paper we give a minimax bound for the size of codes for transmitting quantum information in 
terms of their entanglement fidelity. This is, in some sense, the opposite optimization as in [9], and follows 
the original approach of [5]. Here, too, assisting communication between sender and receiver is used to 
ensure the bound is tractable, and we show that the bound can be formulated as a semidefinite program. 
The advantage of this approach is that, as we show, simple qubit channels inherit converse bounds from 
simple classical channels, enabling us to directly apply results from [5] and [7] for classical channels to 
quantum problems. 

The remainder of the paper is organized as follows. The following section establishes the mathematical 
framework and notation used throughout. Then the precise details of the coding scenario under consider¬ 
ation and the minimax bound, Theorem 1, are presented in Sec. 3. Sec. 4 shows that the minimax bound 
can be expressed as a semidefinite program. Next, Sec. 5 is a somewhat elaborate discussion of how sym¬ 
metry can be used to simply the bound, which ultimately rests on its concavity and convexity properties. 
The particular qubit channel examples are detailed in Sec. 6, and the paper finishes with a discussion of 
related bounds. 

2 Mathematical Setup 

2.1 States and Channels 

In this paper we consider finite-dimensional quantum systems, labelled by capital letters A, B, and so forth. 
The state space of system A is denoted by #€ A , and the dimension of this space |A|. The set of bounded 
operators on .¥C A is denoted by 5$(A), while the set of bounded operators with unit trace, i.e. the states 
of A, is denoted by 3A{A). The maximally mixed state on A is denoted by n A = \ a /\A\. For two systems A 
and A 7 of the same dimension, i.e. with isomorphic state spaces, we may choose a basis {\k)} k in each to 
define the unnormalized vector lO)^/ = J ] k \k) A <S> \k) A i. The canonical maximally entangled state is then 
just 14*)^' = I^)a 4 '- 

A channel jY b \ a is a linear map from 3# (A) to 3A{B) which is both trace preserving and completely 
positive. Adjoints of channels and operators are denoted by *. The Choi representative (or Choi operator) 
of the channel is the bipartite operator N B \ A = {. ( / A ® .Ae|/0[fW]. The Choi representative of a channel 

satisfies N B ^ > 0 and TrgfA/g^] — and any bipartite operator N B ^ satisfying these conditions defines 
a valid channel via p A i —> Tr A [N B \ A p A ], a statement known as the Choi isomorphism [10], Note that trace 
non-increasing completely positive maps have Choi operators with Tr B [IVg^] < t A . 

In contrast, the Jamiolkowski representative of jV b \ a is the operator N B \ A = N B j^, where T A denotes 

the transpose of system A. Put differently, N B \ A — {J? A ^ <8> ®V B \ ^[H^,] [11]. An appealing property of the 
Jamiolkowski representative is that the action of the channel no longer involves the transpose and is just 
p A <—» Tr a[N b \ a P a ]. This formulation makes channel action look quite similar to marginalizing random 
variables in a classical probability distribution; for more on this point, see [12]. Further, the Jamiolkowski 
isomorphism is natural, in the sense that it does not require a choice of basis, since — lT^ lsym , 

and the projectors onto the symmetric and antisymmetric spaces of #€ A ® are basis-independent. 
However, we shall not make particular use of this fact here. 

Bipartite states p AB for which p AB > 0 (equivalently, p AB > 0) will be called positive partial transpose 
states, or PPT for short. A channel is PPT-preserving if a PPT input necessarily results in a PPT output. 
In [13], it is shown that PPT-preserving channels have PPT Choi states (see the discussion after Eq. 4.13). 

T T 

More directly, suppose that cp^g is a state with cp^ > 0 and Q>mm'\ab * s a channel with ^ > 0. Let 

Omm’ = and compute 

(1) 

(2) 
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Although the trace of positive operators is a positive number, the partial trace need not be a positive 
operator (consider Tr B [$ab^bc])> so we cannot conclude that cr^ M , > 0 on this basis alone. However, the 
fact that the (p^ factor is completely traced out, along with positivity of it and ■C , A ^/| AB > together imply 
that all expectation values of cr^ M , are positive. 

2.2 Semidefinite programming 

A semidefinite program (SDP) is simply an optimization of a linear function of a matrix or operator over 
a feasible set of inputs defined by positive semidefinite constraints. We give only the bare essentials here, 
for more detail see [14, 15]. 

The maximization form of an SDP is defined by a Hermiticity-preserving superoperator <S H \ A taking 
9% {A) to m(B), a constraint operator C e PA{B), and an operator K € PA {A) which defines the objective 
function. The SDP is the following optimization, which we will also refer to as the primal form, 

a = supremum Tr[Oi] 

subject to ^(X) < C, (3) 

X > 0. 

When the feasible set is empty, i.e. no X satisfy the constraints, we set a = —oo. 

The dual form arises as the optimal upper bound to the primal form, and takes the form 

/3 = infimum Tr[CT] 

subject to <S*(Y)>K, (4) 

Y > 0. 

Again, when the set of feasible Y is empty, /3 = oo. Weak duality is the statement that a < /3, that indeed 
the dual form gives upper bounds to the primal (or that the primal lower bounds the dual). 

Strong duality is the statement that the optimal upper bound equals the value of the primal problem, 
a = [3 . This state of affairs often holds in problems of interest, and can be established by either of the 
following Slater conditions. In the first, called strict primal feasibility, strong duality holds if [3 is finite and 
there exists an X > 0 such that &(X) < C. Contrariwise, under strict dual feasibility strong duality holds 
when a is finite and there exists a Y > 0 such that <§*(Y) > K. For strongly dual SDPs we also have the 
so-called complementary slackness conditions S*(Y)X = KX and S’(X)Y = CY that relate the primal and 
dual optimizers. 

Neyman-Pearson hypothesis testing is an SDP particularly useful in the study of information processing 
protocols. Given one of two states p or cr, called the null and alternate hypothesis, an asymmetric hypoth¬ 
esis test is a two-outcome POVM [T, 1 — T} that indicates which of the two hypotheses (states) is actually 
present. Here T indicates p. Any such test makes two kinds of errors, the type-I error in which the null 
hypothesis is erroneously rejected, and the type-II error in which the alternate hypothesis is erroneously 
rejected. The probabilities of type-I and type-II errors are just Tr[p(l — T)] and Tr[oT], respectively. Fix¬ 
ing the probability of type-I error to 1 — e, we may ask for the test with the optimal (minimal) type-II error, 
called the Neyman-Pearson test. The optimal POVM is specified by T with the smallest value of Tr[oT] 
such that 0 < T < 1 and Tr[pT] > e. This is a dual-form SDP with Y = T, C = cr, K = (—1, e) and 
g*(Y) = (— Y, Tr[pF]), and it satisfies strong duality. For future use, let us denote this optimal value by 
&(p,cr): 


/3 e (p, cr) := min Tr[Tcr] 

subject to Tr[Tp]>e (5) 

0 < T < 1 . 

The slackness conditions can be used to infer the form of the optimal test, recovering the Neyman-Pearson 
lemma of classical statistics. 
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3 Converse bound 


3.1 Coding scenario 

In this section we define the coding scenario precisely. Here we consider coding schemes for using a 
noisy quantum channel JV B \ A together with some auxiliary assistance channels to create a high-fidelity 
entangled state between sender Alice and receiver Bob. In particular, a PPT-assisted code denoted by 
^ = (Paa'b'> ®mm'|ba'b') consists of a PPT state ipaa'b' with Alice in possession of systems Ati and Bob B' 
as well as a PPT-preserving decoding operation @mm'|ba'b' which outputs system M to Alice and M' ~ M 
to Bob. By a slight abuse of notation, we also denote the dimension of system M by M. Note that the 
decoding operation might consist of several rounds of forward and backward communication between 
sender and receiver. 

The coding scheme proceeds by Alice and Bob creating the state ipMB' by means of a PPT channel, 
and Alice subsequently transmitting the A system to Bob via ,jV b \ a . Finally, they perform the operation 
®MM'jBA'B'> leaving them with a bipartite state in MM'. The resulting entanglement fidelity is defined by 

F{JF -, c €)\= Tr[$MM'®MM'\A'BB' ° ^bIaC^aa'b')] ■ ( 6 ) 

Finally, an (M,e) code for jV b \ a is a pair ^ = (<§a 4 'b'j ®mm'|ba'b') with output dimension M such that 
Fpk; ^) > 1 - e. 

As a side remark, when the initial state Pm'b' is separable and not only PPT, the coding scheme can 
be simplified without loss of generality. In particular, one can dispense with system B' and restrict to pure 
states Y’aa !. To see this, note that the fidelity is a convex combination of entanglement fidelities for the 
constitutent states cr^, ® in tp/j< B > = JT °" AA / ® V B n so it will certainly not decrease when shifting to 

the code ‘gj = (cr^, <E> rj J B „ 3 >mm'\ba'b ') with the largest entangelement fidelity. Further, as r) ] B is fixed, it can 
be absorbed into the decoding operation. Finally, the same argument implies that one can further modify 
cr^, to be a pure state. However, this argument does not necessarily go through in the general case when 
Paa'b' is PPT. 

3.2 PPT-preserving channels 

Instead of the actual channel J/’ B \ A , consider an arbitrary PPT-preserving channel M b \a- This produces a 
PPT state Ymm' = ®mm'|a'bb' ° ^b\a(Paa'b')- The following argument, due to Rains, then immediately 
implies 


M 


(7) 


T 1 

Lemma 1 (Rains [16]). For any subnormalized state Ymm' with > 0, Tr [$ M m'Ymm'~\ ^ 

Proof. The proof proceeds by observing that the partial transpose of the maximally entangled state is the 
swap operator, normalized by M. 


Tr[<W)MM'] = T^MM'i'] 

— ^ ^ ^MM' ^ I °°T r [Tj\^] 

1 

< - 

“ M ’ 


( 8 ) 

(9) 

( 10 ) 


The first inequality follows from the fact that = 7 -Ufiff and therefore 

n J MM M MM MM ~ 11 Ml l 

T- t MM '■ The second inequality is the subnormalization of Ymm'- 


oo ^mm' — 

□ 


3.3 Fidelity in terms of the Choi operator 

Now we show that the entanglement fidelity ( 6 ) can be expressed directly in terms of its Choi operator 
of the channel and without explicit reference to the systems MM'A'B'. To do so, it is actually convenient 
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to start with the Jamiolkowski representation of channel action. In this representation we can write the 
fidelity as 


= Tr mm' l^MM'^A'BB'l^MM'lA'BB'^Ai^BlA^AA'B']]] (12) 

= 1 r MM'ABA'B' [®MM'DmM'\A'BB'Nb\aVaA'B'~\ (13) 

= 1 r MM'ABA'B' [$MM'DmM'\A'BB'Vm' B 'N B 1/J’ (14) 


where, in the last equation, the order of <p and N is interchanged in accordance with the usual rules of 
transposition. 

Define the operator 


h-AB ~ Tr 


MM'A'B ' 


[*. 




(15) 


Since N B \ A 



, the fidelity can be expressed as a linear function of the Choi operator of the channel. 


F^;^) = Tr[A AB N BlA ], 

for the particular A^g defined by the code. An (M, e ) code will have a A^g which satisfies 


(16) 


TrfA^glVg^] — 1 — e - 

Any such operator A^g satisfies the following two simple properties 
Proposition 1. Any operator A ab defined as in (15) satisfies 

0 < A^g < <pJ ® lg, 

T ~ T 

Proof First regard <p A , , as the Jamiolkowski representative R A 'b'Ia = pA lo , 

Ai/l jD I f\r\ D 


(17) 


(18) 

of a channel R%a'b'\a- Then 


A-AB ^A'B'\A° ( 19 ) 

Now observe that S&a'b'\a is completely positive and trace descreasing: 

Ra'b'\a — Paa'b' — 6, (20) 

Tr A /g/[K A /g'| A ] = (pj < 1 A , (21) 

meaning its adjoint action is completely positive and subunital. Therefore, (18) follows from the fact that 

6 - ^MM' — 1mm'- 

□ 


These constraints hold for more than just PPT-assisted codes, as is easily demonstrated. Let S>mm'\ba'b' 
be the channel which creates an( l ignores (traces out) the input systems BA'B'. Then Dmm'\ba'b' = 

^MM'lfiA'B'- This leads to A ab = p A t B , which satisfies both constraints. 


3.4 Minimax converse bound 

It is now straightforward to derive the minimax converse bound. The idea is simple: Maximizing over 
PPT channels ^Z B \ A in the fidelity expression (16) and using (7) gives a lower bound on -R, i.e. an upper 
bound on M. However, the result depends on the details of the code via A^g. This dependence can be 
removed by minimizing A ab over the smallest conveniently-described set which certainly contains the A AFj 
associated with the code. Here, this is the set defined by the constraints in (17) and (18), for arbitrary 
subnormalized < p A , as these do not depend on the precise details of the coding operations. 

To state the minimax bound formally, first define the following sets 

ppt := {M B]A :M BlA > 0, > 0, Tr B [M B{A ] < t A } (22) 

v(jV ,b) := {{Pa’^-ab) : Pa — 0>Tr[(p A ] < 1,0 < A^g < p A t B ,Tr[A AB N B \ A ] > 1 — e}. (23) 


Then we have 
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Theorem 1. Any (M, e) PPT-assisted code satisfies 


min max Tr[A AR M R t A ] 


1 

< — 

“ M' 


For later convenience, let us define 


/O',*?) 


min max Tr[A AR M R \ A ], 


(24) 


(25) 


Before proceeding with the proof, observe that we can interchange the order of optimization in /, due to 
von Neumann’s minimax theorem [17], as the objective function is linear and both , e) and ppt are 
compact, convex sets. This gives 

Corollary 1. Any (M, e) PPT-assisted code satisfies 


max min Tr[A/ioM R | 4 ] 

M BlA & P T& A ,A AB )efW,e) 1 



Proof of Theorem 1 . Consider then the following function. 


/o(Q4b) SU P Tr[04gM B | A ], 

MbiaSppt 


(26) 


(27) 


defined on all bipartite operators {0 AB : 0 < 0 AB < t AB }. First note that the supremum is attained, since 
the objective function is a continuous function M >-» Tr[OM] on a compact set, ppt. Moreover, / 0 is 
continuous; specifically, it obeys 


I/o(Pab) -/o(Oab)I < |A| ||p^ - QabM!. (28) 

To see this, suppose that / 0 (0^) < /o(O^g), otherwise swap the two. Then |/o(0^g)—/ oCQab)| = / O (0^g) - 
/o(Qab)- Let M' m be the optimizer in f () {0' AH ). By the variational characterization of the trace norm, 

fo(O m) > Tr [M'^Ojm] (29) 

— TrfM^O^] — |A| \\0^b ~ QabI It (30) 

= MO^ b )-\A\\\0^ b -O ab \\ 1 . (31) 

For A ab defined from an (M, e) code as in (15), (7) implies fo(A AB ) < -L. Taking the infimum over 
(.<p A ,A AB ') e , e) gives a bound independent of the precise details of the code. Finally, again since 
f(i#, e) is convex and compact and f 0 is continuous, the infimum is attained. □ 


4 The minimax bound as a semidefinite program 


In this section we describe how to formulate the minimax bound as a semidefinite program satisfying 
strong duality. Doing so is straightforward: We simply use the dual of the inner optimization in (24) to 
obtain a minimization problem, or the dual the of the inner optimization in (26) to obtain a maximization 
problem. Ultimately we find the following 


Proposition 2. For any channel ,jY b \ a and 0 < e < 1 , 


/M/e) 


minimum Tr[^ A ] 
subject to E, a 1 b >Aab + 
(-Ta>-^ab) e 
Va) A ab> Lab> /a — 0 


maximum m( 1 — e) — n 
subject to M b]a e ppt, 

mN B \ A < M b \ a + , 

TraD/ts] < rcl A , 

m , UjRas > 0 


(32) 
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Proof. Let us take the former approach, dualize the inner optimization in (26), and then show that strong 
duality holds. Of course, strong duality must hold, as it is equivalent to the saddle point property, but we 
shall give a simple independent argument for strong duality based on Slater’s condition. 

Observe that / 0 is a semidefinite program, in particular, a primal problem as we have defined it, 
with X = M B | A , K = O^g, C = (0,1 A ), and <S(X) = (— X Ta ,Yt b [X]). Choosing for the dual variables 
Y = (r^g, £ A ), the dual of f 0 is 

/o(°ab) : = minimum Tr[£ A ] 

subject to f A t B > 0 AB + (33) 

r AB’ — 0 - 

Combining this with the outer optimization over e) gives the minimization program in (32). 

The equality statement is precisely strong duality of the primal and dual forms of the inner optimiza¬ 
tion. By Slater’s condition, strong duality holds if / 0 is finite and there exists a strictly feasible set of 
dual variables. Observe that / 0 (O AB ) < |A|, since for the optimal M B \ A we have /ofO^g) = Tr [M b \ a O ab ] < 
Tr[M B | A ] < Tr A lA = |A|. Here we have used the upper bounds 0 AB < l^g and Yr B [M B \ A ] < t A . Thus, the 
first condition is fulfilled. Meanwhile, T^g = I AB and f A = 31 A are a strictly feasible pair. Thus, f 0 = f 0 
over the domain of interest. 

To construct the maximization program, we simply dualize the minimzation program. In particular, 
e) is a dual-form semidefinite program in the variable Y = (i/? A , A^g, T^g,^) with C = (0,0,0,1 A ), 
K = (1 — e, —1,0,0), and 

£*(Y) = (TrflVg^A^g], — Tr[i/? A ], <p A lg — A^^lg — A^g — T^). (34) 

Choosing primal variables X = (m, n,R AB ,M AB ) leads to the maximization in (32). 

Equality again follows from Slater’s condition: / is finite by the minimax formulation (in particular 
the bound on / 0 used above), while a feasible choice of dual variables is given by M AB = R AB = 

Z\B\ 

n = 1, and m = 2 \a\\b\ ' The choice of m ensures the first constraint holds strictly, since any Choi operator 
of a trace-preserving map satisfies ||lVg| A |loo = |A|. □ 


No discussion of strong duality of semidefinite programs is complete until the complementary slackness 
conditions have been formulated. Often, these give considerable insight into the form and properties of 
the optimizing variables. First observe that 

OT = (~nl A + Trg [R^l mN BlA - M m - R m , -AfJ}, Tr B [M m ]f (35) 


Then the conditions are easy to read off from the form of C and K. They are 


Tr[¥? A ] = 1 


TrfA^gA/g^] = 1 — e, 

Pa^-ab = A-ab^-ab 

^a^b\a = (Aab + h^jMg^ 

^A = Trg[fT^](/> A 


Tr B [Mg [4] — f A , 

m ^B\A^-AB = (Mg| A + l? A g)A A g. 


(36) 

(37) 

(38) 

(39) 

(40) 

(41) 

(42) 

(43) 


5 Channel symmetry 

Symmetries of the channel can greatly simplify the calculation of the minimax bound. First let us state 
precisely what we mean by channel symmetries. Suppose G is a group, possibly a topological group, 
represented by operators U g on A and V,, on B. A channel J / B | A is covariant with respect to G when 

V g ^(-)V* = Jf{u s • Up Vg e G. (44) 
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We can write this as an invariance of the channel: 


^(•) = V*Jf{U g ■ U* g )V g Vg e G. (45) 

In terms of the Choi operator, the condition is simply 

(t/J) A ® (V*) B N BlA (U g ) A ® (V g ) B = Vg € G. (46) 

Thus, the Choi state is a fixed point when averaging over the action of the group. To enforce such averag¬ 
ing, introduce the superoperator ‘^g: 


»(Qu») - 


Mg) (t/J ) A ® (vpMuJ )* ® (v g ) B , 

Jg 


(47) 


where pi is the Haar measure of the group. Observe that C S* = C S, since taking the adjoint of the group 
elements just reparameterizes the group, sending g to g -1 . 

Due to the structure of the symmetrization ( 3, we have the following 


Proposition 3. Suppose (c/^A^g) e e). Then (^(</? A ), ^(A^g)) e f(«A4 e). Similarly, ^(M B | A ) e ppt 
for any M B ^ e ppt. 

Proof. Start with the latter claim, and let M B \ A = ^(M B | A ).The positivity condition in (22) holds for M B \ A 
since ( S is completely positive. For the trace condition, we have 


Tr B [M B | A ] — 


d/x(g) {U l s ) A Tr B [(Vg ) B M b|A () B ] ( U g )* 


< 

Jg 

-1 a 


dp(g) (U!/) a 1 a (U‘J 


g J A 


(48) 

(49) 

(50) 


For the partial transpose condition, note that for any operator 0 AB , 


O A = 
U AB 


d Kg) (u g ) A ® m B o%(u g T A ® 04) B , 


(51) 


- y 

which is the action of a slightly different, yet still-completely-positive, version of C S on 0 AH . Thus, M g is 

T 

positive if M g is. 

For the former claim, again let (p A = ‘Sff'p/f) and K m = i S^ 4 B (A ab ). Returning to (23), the positivity 
conditions (p A , A^g > 0 hold because ^ is completely positive.The two trace conditions still hold because 
oTjis G-covariant. It remains to show the upper bound on A^g: 


A ab - 


d Kg) 0/J)a® (Vpg(^ ® lg)(Gj)>(V g ) £ 
dpt(g) {Ul<p T U g ) A ® 1 B 


= ( 


d/i(g) U*tpUg)l® h 


= { Pa 1 


B • 


(52) 

(53) 

(54) 

(55) 

□ 


As shown by Polyanskiy for the classical metaconverse [7], we can now show that G-covariant quantum 
channels have G-invariant optimizers. Letting ppt g be the set of Choi operators of PPT-preserving channels 
which are invariant under c 3, and similarly F r; (,yK, e) the intersection of r(.yV, s) with G-invariant operators, 
we have 
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Theorem 2. For G-covariant channels ^V b \a> we can restrict the optimizations in (24) to ppt g and t G (jV B \ A , e ). 
Proof. The proof proceeds similarly to that of [7, Theorem 20]. To simplify notation, define 

gCOts) = (t £)a ® (Y^)bOab(U l f A ® (V g ) B . (56) 

Now consider the outer optimization in (24). First note that the function f 0 from (27) is convex, as it is 
the pointwise maximum of linear functions. Furthermore, / 0 must be constant on orbits of G. Suppose 
M' b a is the optimizer for g(0 Alj ) for some arbitrary g € G, so that f 0 {0 AB ) = Tr[M' A g(0 AB )]. It follows 
that g~ 1 (M / B | A ) is feasible for/ 0 (O^), since independent unitary operations on A and B are PPT-preserving. 
Hence, /oCQ^) > / 0 (gCQ^)). But the same argument implies / 0 (g(O^)) > / 0 (g _1 ° gCQ^)) = / 0 (O^). 
Applying Jensen’s inequality and taking the minimum over (p A , A^g) e f(A®, f) gives 


min 

(‘PA>44s) eF (4^ , e ) 


/o(^(Aab)) — 


min 


/o (A^g)- 


By Proposition 3, we can restrict the optimization on the lefthand side to F g {.jY, e) and obtain 


(57) 


min /o(Aab) —, min UA^). 


(58) 


On the other hand, since we are now minimizing over a smaller set, the lefthand side of this expression 
cannot be smaller than the right, so equality holds. 

Next consider the inner optimization in (24), with the additional restriction to f g (A®, e) in the outer 
optimization. For fo(A Alj ) with G-invariant A ab , the objective function does not change upon replacing 
M b \ a by ( ${M b \ A ). Proposition 3 then implies that we can safely restrict the optimization to ppt g . □ 


6 Examples 

6.1 Qubit dephasing channel 

Here we show that both finite blocklength converse and achievability bounds for the qubit dephasing 
channel can be inherited from the corresponding bounds in [5] for the classical binary symmetric channel 
(BSC). The dephasing channel Jf B \ A has Kraus operators — pi and y/pcr z , and the Choi operator is 
N b | A = 2(1 — p)4>^g + 2p$f B . Here 4> + is the canonical maximally entangled state, i.e. <F + = <f>, and 
<F“ = (1 < 8 > cr z )<f>(l ( 8 ) cr z ). Since the Bell states are orthogonal, dephasing is essentially the BSC for phase 
errors. 

Clearly JF B ^ is covariant under the action of cr z ; indeed, it is covariant under any unitary operator 
diagonal in the dephasing basis. Since cr x cr z cr x = —cr z , it is covariant under the action of cr x as well. For a 
single qubit the relevant symmetry group is G 1 = {1,ij x ,ct y ,cr z } (up to phases, which are irrelevant since 
the group acts by conjugation by the Pauli operators). The corresponding C S\ on the input space has the 
action c S 1 {p) = n for all p, where n is the maximally mixed state. 

The memoryless extension inherits these symmetries in each input space, so G n = 

G 1 x G 1 x • • • x G 1 . Similarly, c S n {p n ) = n n has the effect of completely depolarizing the input state. That 
is, the optimal input state p A n for the bound is maximally mixed. Thus, 

FG (^, e) = ‘ 0 - A A n B n < ^t A n B n ,Fr[A A n B nN B ^] > 1 — ff}. (59) 

Furthermore, we can restrict A A n B n to be in the support of IV®” without loss of generality, since feasibility 
will not be affected and the objective function in the inner minimization of the maximin bound (26) can 
only decrease. Staying with the maximin bound, we may choose Ji B n^n to have Choi state M®" with 
M B \ A = i- e - fully dephasing channel. Defining L A n B n = 2 n A A n B n, (26) then yields 

h > minimize Tr[L A n B n^Z BlA (<i> AB )® n ] 

L A n B n 

subject to Tr[L A ng„A^| A (<F 4 g)®' 1 ] >l-e, ( 6 °) 

0 < L A n B n < l A n B n. 


9 



Letting co^ = ^| A Oab) = (1 ~P)®ab +P^ab and g ab = ^b\a($ab) = ^Ib + ^ab^ the righthand side is 
just the minimal type-II error of distinguishing a>fjj from oyjg, for type-I error constrained to be no larger 
than e. That is, 


1 

M 


>/?i s^m’^abJ- 


®n (gui-i 


(61) 


Since both states are diagonal in the same basis, the hypothesis test between co^g and cr^g can be 
recast as a test between classical distributions. Observe that measuring each system of co^ and u AH in the 
basis of a x produces the probability distributions P XY and P x Q y , respectively, with P x and Q Y uniformly 
distributed and P[X = Y] = 1 — p. Moreover, we can reconstruct the original states co AB and u AB from P XY 
and P x Qy with the map that sends (X, Y) to when X = Y and to T otherwise. Therefore we have 


— >/3i_ 

M 1 1 


fpxn p > 

VXY ’ r X 


"Qy")- 


(62) 


This bound is precisely the expression obtained for the binary symmetric channel by Polyanskiy, Poor, 
and Verdu [5, Theorem 26] (see also [7, Theorem 22]). Thus, the dephasing channel inherits the finite 
blocklength converse of the BSC, and the bounds are identical if our choice for ^ B n\ A „ is optimal. 

Regardless of the optimality of ^Z B n\ A n, asymptotic results such as the strong converse and second 
order coding rate for the dephasing channel follow directly from the classical problem, in particular Eq. 
160 and Theorem 52 in [5], respectively. Alternately, one can deduce the strong converse property more 
immediately by simply invoking Stein’s lemma on (61) or (62). (Note that the strong converse was first 
shown in [18].) 

Finally, the achievable bounds for dephasing are also at least as good as those of the classical BSC, 
simply because any classical code for the BSC can be regarded as correcting phase errors and applied to 
the dephasing channel. More specifically, the classical code can be used as part of a CSS-like quantum 
code, as described in [19]. The “error correction” part of the code (see the Remark prior to §V) is just 
the code for the BSC, applied to the cr x basis (i.e. with inputs diagonal in this basis). We can dispense 
with the “privacy amplification” part of the code, since it may be easily verified that the complement of 
the dephasing channel has a constant output on inputs diagonal in the cr x basis. We require a CSS-like 
code and not a proper CSS code because the classical BSC code need not be a linear code. Note also that, 
importantly, the guessing probability of the classical code is equal to the fidelity of the quantum code for 
this channel. For a more detailed and convincing discussion of this point, see [2], 


6.2 Erasure channel 

For the qubit erasure channel we can inherit a converse bound from the metaconverse of the classical 
binary erasure channel (BEC). The qubit erasure channel has qubit input and output B of dimension three, 
namely B = A© C. The extra dimension indicates to the receiver that the input was erased. The Choi state 
of the erasure channel with probability p is simply N B ^ = 2(1 — p)$ 4 g + 2pn A <g> |e)(e| B , where |e) is the 
additional vector in B. The channel is covariant with respect to action by any unitary on the input, with 
corresponding inverse on the output, plus dephasing of the output into the A and \e) subspaces. Therefore, 
the optimal input state is the maximally mixed state. Let co be the output of the n-fold application of 
J/' B \ A and let cr A n Bn be the output of the PPT map Jl B n \A n • 

As with the dephasing channel, consider a measurement of A n and B n in the standard basis and call 
the output random variables X n and Y n , respectively. From we obtain the distribution P XY , with P x 
uniform and Y = X with probability 1 — p and equal to e with probability p. Note that we can recover 
co A g from P XY by employing the map which produces n A ® \e)(e\ B when Y = e and otherwise $ AB - Now 
let cr A n B n be the state obtained by this map for the distribution p x n Q Y n with Qyn the optimal choice for the 
metaconverse of the classical BEC, Eq. 168 of [7]. Due to the product form of the classical distribution, 
<J A n B n can be obtained from a PPT channel acting on the maximally entangled state. Thus, we have 
I3\- e (co^, a A n B n) = /)j_ K (P®",P® n Qyn), where the latter quantity appears in the converse bound for the 
classical BEC given in [7]. Therefore, by the minimax bound we obtain 

^>^_ £ (P® y n ,P® n Qyn), (63) 
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meaning the minimax bound for the BEC also applies to the qubit erasure channel with PPT assistance. 
Again, we may infer asymptotic statements such as the strong converse and second order coding rates 
from this bound. 

6.3 Depolarization 

The depolarizing channel has Choi state N B \ A = 2(1 — + ^ab^> where p is the 

probability of depolarization, and T = are obtained from 4> + by conjugation with cr x and cr y , respectively. 
The symmetry group of this channel includes all unitary operations, meaning that the optimal < p A is again 
the maximally mixed state. If we choose M B n\ A n = M®” with M B \ A = + the minimax 

bound involves the optimal hypothesis test between n copies of co^g = (1— + + + and 

n copies of cr^g = As in the case of dephasing, we can convert the hypothesis 

test between co AB and a AB into a test between classical distributions, in fact precisely those distributions 
which were used in the dephasing example. This follows by considering the map which generates 4> + 
when X = Y and otherwise randomly generates one of the other Bell states when X j- Y. Therefore, we 
obtain the same bound, (62), for depolarization as for dephasing. 

This raises the question of whether PPT assistance can turn the depolarizing channel into the dephasing 
channel. To investigate this further, a sensible first step would be to establish optimality of the two bounds, 
to ensure they are truly equivalent. 


7 Discussion 


We have derived a minimax bound for the size of a PPT-assisted quantum code given a target entanglement 
fidelity, very much along the lines of the classical bound by Polyanskiy, Poor, and Verdu [5]. The restriction 
to PPT-assistance comes from the use of Rains’s bound, Lemma 1, though in principle, the bound applies to 
all channels ^i B \ A which deliver a state having overlap 1/M with the maximally entangled state. Focussing 
on PPT has the advantage that the PPT conditions can be phrased as linear constraints, and lead to a 
semidefinite program formulation of the bound. 

It would be desirable to incorporate the PPT constraint into the feasible set f(M / , e) itself, so as to 
tighten the bound. Additional constraints can indeed be found along the lines of Leung and Matthews [9]. 
However, it appears to be impossible to incorporate such additional constraints on f(^Y, e) and obtain a 
bound, like that of Theorem 1, in which we optimize the code size for fixed target fidelity. The difficulty is 
that the further constraints on f(,vV, s) directly involve M, the size of the code. Leung and Matthews avoid 
this problem by optimizing the target fidelity for fixed code size, rather than the other way around. 

The analog of this issue in the classical case is that the metaconverse in [5] also applies to non¬ 
signalling assisted codes [6, 7], not just unassisted codes. There, however, no great gain is to be had by 
adding further constraints to the analog of f{J/',e'), potentially tightening the bound: The 1/M bound is 
already quite tight at moderate blocklengths for channels of interest, as shown in [5], 

One might also hope to obtain a useful bound for unassisted codes by subsituting 1 A 8> <j b for the output 
of the PPT channel ~M B \ A . Despite its normalization, this operator also has the correct 1/M overlap with 
the maximally entangled state. Moreover, it would lead to a hypothesis-testing quantity reminiscent of 
the coherent information, whose regularized version is part of the formula for quantum channel capacity. 
However, the resulting optimization is not a semidefinite program, as now we will have to set M B ^ = 
<p A l ® cr B , leading to < p~ l appearing in the objective function along with A ab . Concretely, we obtain 


1 


— > min 


M ^A>AiB eF (yK ,s) 


max r Tr\_K AB y A 1 <8 )u B ] 

g B 


min max Tr[r^gl A ® cr B ], 

, f ) a B 


(64) 

(65) 


where f '{Jf, e) consists of normalized states tp A and T^g with 0 < T^g < l^g such that Trfr^gipj^A/gi^i/r^ 2 ] > 
1 — e. Furthermore, the symmetry arguments employed in Proposition 3 no longer go through, making 
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the resulting bound difficult to work with. This difficulty is perhaps to be expected, since otherwise sym- 
metrization might lead to a single-letter bound in terms of the coherent information for general channels, 
which is known to be false. 

The same difficulty applies to formulating an SDP-based bound using any PPT state, not just the output 
of a PPT channel on the input ip A as done in Theorem 1. This is the approach taken by Tomamichel 
and Berta to obtain second-order coding rates for simple channels in [1]. In the notation of this paper, 
their bound can be expressed as 


min max Trfr^gcr^] < —, 

ppt M 


( 66 ) 


with the same f' as in the previous paragraph. (Actually, we have interchanged a minimization over T 
with the maximization over cr, but this is permissible by the minimax theorem.) Nevertheless, symmetry 
arguments do go through in this case, and can be employed to infer that optimal input states ip A can 
be chosen to be invariant under the channel symmetry group. For highly symmetric channels such qubit 
dephasing, depolarization, and erasure, this fixes ip A to the mixed state. Then the nonlinearities of the 
optimization disappear and the resulting bound is identical to the minimax bound (24). Indeed, for these 
channels, one could use the results of §6 to more quickly obtain their results regarding second-order 
coding rates. The form of the bound also implies that the minimax bound can be obtained by loosening 
(66), restricting the optimization of a AB to states of the form a AF = (p A M B \ A ip A for some PPT channel 
M B \ A . Equivalently, cr AB must be PPT and satisfy Tr F [<J AF ] = y A . 
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